A Parallel FP-Growth Mining Algorithm with Load Balancing Constraints for Traffic Crash Data

نویسندگان

چکیده

Traffic safety is an important part of the roadway in sustainable development. Freeway traffic crashes typically cause serious casualties and property losses, being a threat to public safety. Figuring out potential correlation between various risk factors revealing their coupling mechanisms are effective ways explore identity freeway crash causes. However, existing association rule mining algorithms still have some limitations both efficiency accuracy. Based on this consideration, using data obtained from WDOT (Washington Department Transportation), research constructed multi-dimensional multilevel system for analysis. Considering load balancing, FP-Growth (Frequent Pattern- Growth) algorithm was optimized parallelly based Hadoop platform, achieve efficient accurate calculation massive amounts data; then, according results mechanism among precursors, causes were identified revealed. The show that parallel FPgrowth with balancing constraints has better operating speed than conventional FP-growth towards processing big data. This improved makes full use cluster resources more suitable large sets while retaining original advantages algorithm. In addition, rules model improvement interaction proposed can catch occurrence consequences (lower support degree probably) accurately efficiently.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Load Balancing Approach for Parallel FP-Growth

Due to the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. So the huge size of data and computation volume of new processing applications such as data mining, leads to new high performance parallel processing systems. One of the most important challenges of such application is quickly and correctly finding the relationship ...

متن کامل

LPAS: High Efficiency Load Balancing Parallel Data Mining Algorithm

Association rule discovery plays an important role in knowledge discovery and data mining, and efficiency is especially crucial for an algorithm finding frequent itemsets from a large database. Many methods have been proposed to solve this problem. In addition, parallel computing has been a popular trend, such as on cloud platform, grid system or multicore platform. In this paper, a high effici...

متن کامل

Load Balancing Approach Parallel Algorithm for Frequent Pattern Mining

Association rules mining from transaction-oriented databases is an important issue in data mining. Frequent pattern is crucial for association rules generation, time series analysis, classification, etc. There are two categories of algorithms that had been proposed, candidate set generate-and-test approach (Apriori-like) and Pattern growth approach. Many methods had been proposed to solve the a...

متن کامل

Dynamic load balancing for parallel traffic simulation

In this paper, we describe implementation of the Parallel Traffic Simulator which applies parallel algorithms and computer graphics in the field of traffic simulation. It simulates traffic behaviour at the individual vehicle level. Parallel environment used consists of a set of workstations connected in a network. Networked workstations make up a loosely coupled parallel computer architecture w...

متن کامل

On Choosing a Load-Balancing Algorithm for Parallel Systems with Temporal Constraints

A key point in parallel systems design is the way clients requests are forwarded and distributed among the servers, trying to obtain the maximum throughput from them or, in other words, the load-balancing policy. Although it is a largely studied theme, with well accepted solutions, the inclusion of temporal constraints, also denoted as deadlines in this work, to the requests brings new complexi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computers Communications & Control

سال: 2022

ISSN: ['1841-9844', '1841-9836']

DOI: https://doi.org/10.15837/ijccc.2022.4.4806